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PROBLEM TO BE SOLVED: To provide a user programmable environment in 
the control of a transmission amount. 

SOLUTION: From a receiver 302 to a transmitter 301, a transmission amount 
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transmitter 301 which receives it, based on both of the script detected in a 
control method interpretation means 31 and the information of a transmission 
line detected in a transmission line observation means 33, the transmission 
amount and the method of encoding to be sent to the receiver 302 are decided 
and transmitted. 
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CLAIMS 



[Claim(s)] 



[Claim 1] The transceiver approach which is the approach of encoding, 
transmitting and receiving either an image or voice at least based on the 
transmission control information which described the approach of coding, and is 
characterized by the ability to change said transmission control information. 
[Claim 2] The transceiver approach according to claim 1 that transmission 
control information is characterized by being first sent from a receiving side. 
[Claim 3] The transceiver approach according to claim 1 by which it is describing 
[ the information about at least one or more control approaches ]-among key 
frames which should be transmitted characterized [ insertion spacing of the 
frame encoded based on the information on a transmission rate, the playback 
frame rate by the side of decode, and either of the transmission loss in a 
quantization step, the frame rate at the time of coding, the size of an image 
frame, and a frame, insertion spacing of the frame by which interframe coding 
was carried out, and ] at least at transmission-control information. 
[Claim 4] A key frame is the transceiver approach according to claim 3 
characterized by being expressed with the priority which shows the significance 
of decode processing for every image frame, and being able to carry out priority 
processing in said priority. 

[Claim 5] The transceiver approach according to claim 1 to 4 characterized by 
changing said transmission control information using the information which 



described the classification of the contents contained in an image or voice data 
at least. 

[Claim 6] The sending set which realizes the transceiver approach given in any 
of claims 1-5 they are. 

[Claim 7] The receiving set for realizing the transceiver approach given in any of 
claims 1-5 they are. 

[Claim 8] The transceiver system equipped with at least one set of at least one 
set of the sending set for realizing the transceiver approach given in any of 
claims 1-5 they are, and the receiving set for realizing the transceiver approach 
of a publication to any of **** 1-5. 

[Claim 9] The program for realizing the transceiver approach given in any of 
claims 1-5 they are. 



DETAILED DESCRIPTION 

[Detailed Description of the Invention] 
[0001] 

[Field of the Invention] This invention relates to the program which realizes the 
transceiver approach, the transmitter-receiver, and them which encode efficient, 



and transmit and receive data, such as voice, especially about the transceiver 

approach of an image or voice. 

[0002] 

[Description of the Prior Art] In the conventional transmission rate control system, 
a certain protocol is decided and the amount of transmissions is controlled by 
adjusting a quantization step and a frame rate based on a coding result 
according to the protocol. 

[0003] Moreover, there is a method of transmitting the frame rate in each image 
size which can be maximum displayed, and the maximum ready-for-receiving 
ability bit rate to a server side as an approach of adjusting the amount of 
transmissions according to the capacity of a receiving side (ITU-T 
recommendation H.245). 

[0004] By using this conventional method, it is possible to fit a transmission rate 
to the band (transmission band) of the network which can be used in order to 
transmit and receive image voice data. 
[0005] 

[Problem(s) to be Solved by the Invention] However, the approach usually 
beforehand prepared by the server side is used for the method of adjusting the 
amount of signs of image voice data. 

[0006] Therefore, it is not easy if you want to fix a frame rate to a certain 



constant value according to a transmission band, to restrict the maximum of a 
frame rate in consideration of the engine performance of a receiving-side 
terminal in to reflect a demand of the user of making it want to lower image 
quality from a certain level ****. By the conventional approach, while the user 
supervised the condition of transmission, the transmission approach was 
controlled by the manual operation to fill the demand of a receiving side. 
[0007] That is, by this approach, although rate control reflecting the capacity of a 
receiving-side terminal can be performed H.245 by transmitting the maximum 
frame rate to a transmitting side from a receiving side, when the demand of a 
receiving side changes according to a transmission band, correspondence is 
impossible. 

[0008] Furthermore, although an image frame is thinned out in the conventional 

rate control system in order to adjust the amount of signs, this takes and spills an 

important scene and it may not be transmitted to a receiving side. 

[0009] This invention aims at offering the program for realizing the transceiver 

approach, the transmitter-receiver, and them which solve the above-mentioned 

technical problem. 

[0010] 

[Means for Solving the Problem] Since the above-mentioned purpose is attained, 
based on the transmission control information which described the approach of 



coding, at least, this invention is the approach of encoding either an image or 
voice, and transmitting and receiving, is enabling modification of transmission 
control information, and can realize the transmission approach according to the 
purpose or an application. 

[0011] And transmission control information is the transceiver approach 
characterized by being first sent from a receiving side. 

[0012] Moreover, based on the information on a transmission rate, the playback 
frame rate by the side of decode, and either of the transmission loss, the 
information about at least one or more control approaches is described at least 
by transmission control information among a quantization step, the frame rate at 
the time of coding, the size of an image frame, insertion spacing of the frame 
encoded in the frame, insertion spacing of the frame by which interframe coding 
was carried out, and the key frame that should be transmitted. 
[0013] Moreover, a key frame is expressed with the priority which shows the 
significance of decode processing for every image frame, and can carry out 
priority processing in said priority. 

[0014] Furthermore, transmission control information is changed using the 
information which described the classification of the contents contained in an 
image or voice data. 
[0015] 



[Embodiment of the Invention] Hereafter, the gestalt of operation of this invention 
is explained, referring to a drawing. 

[0016] Drawing 1 is the schematic diagram of the image voice sending set in the 
gestalt of operation of this invention. 

[0017] In this drawing, a transceiver means 10 to transmit and receive 
information is a means to transmit and receive the information on LAN, a modem, 
etc. As communication environment, the transmission environment stated by 
drawing 1 is taken into consideration. 

[0018] The coding means 11 is a means for encoding image voice data. For 
example, you may be H.261, H.263, MPEG1/2/4, G.711, and a standardized 
coding method that is called G.729, and that may not be right. 
[0019] The amount control means 12 of transmissions is a means to adjust the 
amount of signs of image voice data by selection of the frame inputted into a 
coding means, and decision of a quantization step. 

[0020] The transmission-line observation means 13 is a means for offering the 
information for presuming the condition of a transmission band or a receiving 
side using the feedback information from a receiving side, and adjusting the 
amount of signs of image voice data in the amount control means 12 of 
transmissions. The information on the loss information on a packet, delay, or a 
jitter and the information of the output situation (playback frame rate by the side 



of decode) of the image voice of a receiving side are included in feedback 
information. Suppose that the condition of transmission of the information on the 
loss information on a packet, the delay, and the jitter which are contained in 
these feedback information, and the image speech information presumed from 
the output situation of the image voice of a receiving side is called a transmission 
condition. 

[0021] The control approach interpretation means 14 shown in drawing 1 is a 
means to interpret the control information transmitted from a receiving side. 
Control information is described by script languages, such as JAVA and Perl. 
The information described is the value of a parameter required for adjustment of 
the amounts of signs, such as the fluctuation range of a frame rate, the 
fluctuation range of a quantization step, image size, insertion spacing of the 
frame encoded in the frame, and insertion spacing of the frame by which 
interframe coding was carried out, and is described that the value of these 
parameters changes according to a transmission condition (for details, 
explanation of drawing 4 describes). Control information may be inputted from 
the local storage of a transmitting side. 

[0022] The terminal control means 15 is a means to control each [ these ] means. 
[0023] Drawing 2 is the schematic diagram of the image voice receiving set in 
the gestalt of operation of this invention. 



[0024] In this drawing, a transceiver means 20 to transmit and receive 
information is a means to transmit and receive the information on LAN, a modem, 
etc. An environment (for example, Internet) in which the band which can be used 
carries out time variation as communication environment is taken into 
consideration. 

[0025] The decryption means 21 shown in drawing 2 is a means for decrypting 
image voice data. For example, you may be H.261, H.263, MPEG1/2/4, G.711, 
and a standardized decryption method that is called G.729, and that may not be 
right. 

[0026] The output means 22 is a means which carries out the playback output of 
the decrypted image voice data. 

[0027] The transmission-line observation means 23 is a means to feed back a 
receiving situation to a transmitting side. The loss information on a packet, delay 
information, and the information of the image voice output situation of a receiving 
side are included in the information to feed back. I hope that RTCP 
(RTPControlProtocol, IETFRFC1889) which is the standardized format as a 
format of feedback information may be used, and that is not right. 
[0028] The control approach generation means 24 is a means to generate a 
change demand of the value of a parameter required for adjustment of the 
amounts of signs, such as the fluctuation range of a frame rate, the fluctuation 



range of a quantization step, image size, insertion spacing of the frame encoded 
in the frame, and insertion spacing of the frame by which interframe coding was 
carried out, according to a transmission condition. The change demand of a 
parameter is described by script languages, such as JAVA and Perl. 
[0029] The terminal control means 25 is a means to control each [ these ] means. 
[0030] Drawing 3 is drawing showing the information transmitted between the 
image voice sending set 301 and the image voice receiving set 302. 
[0031] The control information transmitted to the control approach interpretation 
means 31 from the control approach generation means 30 of drawing 3 is a 
script for the amount control of transmissions. It is described by the script that a 
parameter is determined according to a transmission condition (for details, 
explanation of drawing 4 describes). It is a time of modification of for example, 
the time of connection or the amount control approach of transmissions being 
demanded by the user that this control information is transmitted. 
[0032] The data transmitted to the transmission-line observation means 33 of a 
transmitting side from the transmission-line observation means 32 of the 
receiving side of drawing 3 are the information showing the receive state of 
image voice data. The information and delay information on the packet loss of 
image voice data may be transmitted using a format of the criterion represented 
by RTCP, and that may not be right. 



[0033] The data transmitted to the decryption means 35 from the coding means 
34 of drawing 3 are image voice data. 

[0034] A sequence number can be added to the image packetized voice 
transmitted, and the rate of a loss of a packet can be known by investigating lack 
of a number. A sequence number hopes that RTP (Real-TimeTransportProtoc ol 
and IETFRFC1889) which is a standard format may be used, and that is not right. 
[0035] Drawing 4 is the example which described the script which controls the 
transmission approach transmitted to a transmitting side from a receiving side by 
Perl. In case a receiving side connects with a transmitting side, it transmits this 
script to a transmitting side. A transmitting side chooses a parameter according 
to the transmission condition observed, and adjusts the amount of transmissions. 
When the transmission band observed with the amount observation means of 
transmissions of a transmitting side is more than 128Kbps in the case of this 
example, a quantization step ($qp) is fixed by 8, and the amount of 
transmissions is adjusted so that the maximum of a frame rate may be set to 10 
($max_frame_rate). Moreover, image size ($image_size) has chosen CIF (step 
41). 

[0036] When a transmission band is under 32 or more Kbpsesl 28Kbps, the 
fluctuation range of a quantization step was fixed in 8 to 24 (in $minqp, the 
minimum value of a quantization step and $maxqp express the maximum of a 



quantization step), and it is specified that the priority of a frame discards a frame 
smaller than 3 (step 42). 

[0037] When a transmission band is 32 or less Kbpses, a frame rate 
($frame_rate) is fixed to 3 and insertion spacing ($i_frame) of the coding frame in 
a frame is made into one frame at five frames (step 43). 

[0038] A script is transmitted from a receiving side at the time of connection of a 
transmitting side and a receiving side, as stated also in advance. 
[0039] Moreover, also when a demand is changed during connection, you may 
transmit. Furthermore, a script may be managed by the transmitting side and 
may be chosen by the receiving side. Although it is described by the gestalt of 
this operation that the value of each parameter is changed according to a 
transmission band, actuation of coding according to the playback frame rate in 
not only a transmission band but the rate of a packet loss or a receiving side may 
be described. 

[0040] Drawing 5 is drawing explaining the priority of a frame. The priority of a 
frame shall already be given to contents. 

[0041] Like drawing 5 , the priority shall be beforehand given to the contents of 
an image or voice for every frame, and a high priority shall be given to an 
important frame at them. In the script mentioned above, if specified that what 
has the low priority of a frame is discarded, the amount control means of 



transmissions will choose based on the priority of a frame. 
[0042] Drawing 6 is drawing explaining how to determine a parameter for the 
information which described the classification of contents to perform the amount 
control of transmissions. 

[0043] In a receiving side, based on the classification information (this example 
<movie>) described to contents, the script suitable for contents is read from a 
record medium, and it transmits to a transmitting side. For example, a movie 
performs description which gives priority to a frame number for image quality for 
sports from a frame number. 

[0044] Drawing 7 is a flow chart which shows the procedure of the control 
approach interpretation means. 

[0045] First, the default of each parameter is set up (step 71) and then the 
information on a transmission condition (this example transmission band) is 
acquired from the amount observation means 13 of transmissions (step 72). It 
changes into the value which wishes for a default according to a transmission 
band (step 73). 

[0046] Actuation of this part is determined by the script sent from the receiving 
side. The script shown in drawing 4 shall be used in this flow chart. Finally a 
parameter is returned to the amount control means of transmissions at a setup 
of delivery and a default. 



[0047] Drawing 8 is a flow chart which shows actuation of the amount control 

means of transmissions at the time of transmitting image data. 

[0048] Before explaining actuation, the semantics of each variable in drawing is 

explained. 

[0049] FramePri expresses the threshold of the frame priority specified from the 
receiving side. 

[0050] MaxFrameRate expresses the maximum of the frame rate specified from 
the receiving side. 

[0051] OriginalFR expresses the frame rate of contents. 

[0052] FrameRate expresses the frame rate specified from the receiving side. 

[0053] FrameCounter is a counter for thinning out a frame. 

[0054] Iframe is insertion spacing of the coding frame in a frame specified from 

the receiving side. 

[0055] l-FrameCounter is a counter for insertion of the coding frame in a frame. 
[0056] QP expresses a quantization step and MaxQP and MinQP express the 
maximum and the minimum value of a quantization step which were specified 
from the receiving side, respectively. 

[0057] Then, actuation of the amount control means 12 of transmissions is 
explained. 

[0058] Actuation is divided roughly and it consists of a part (steps 807-810) 



which determines what kind of coding processing is performed to the frame 

which encoding whether the frame taken out from contents is encoded with the 

part (steps 801-806) to judge determined. 

[0059] First, a frame is taken out from image data (step 801). 

[0060] In [ which is not fixed ] judging [ whether it is frame rate regularity and ] 

based on a script (step 802), it judges whether it is that by which the frame 

should be discarded based on the priority of the frame, and a current frame rate 

(step 803). 

[0061] When a frame rate is fixed, a frame is thinned out so that it may become 
the appointed frame rate (step 804). 

[0062] Next, the residue of the coded data which is not transmitted [ which 

remains in the buffer for holding until the coded data outputted from a coding 

means is transmitted ] is investigated. The frame is discarded when there are 

more data residues in a buffer than a certain threshold (step 805). 

[0063] By this actuation, exceeding the amount of data which can transmit the 

amount of signs is lost. When the data residue in a buffer is less than the 

threshold, it is determined that the frame will be encoded by it. 

[0064] The frame count variable which is used in frame rate immobilization is 

returned to 0 (step 806), and it determines [ which encodes the frame in a frame 

/ or or ] whether to carry out interframe coding (step 807). 



[0065] Next, based on a script, it judges whether a quantization step is 
immobilization (step 808). 

[0066] in not being immobilization, it determines per GOB (GroupOfBlocks) that 
a quantization step will not exceed the maximum of the specified quantization 
step, and the minimum value, and passes a coding means (for example, the 
decision approach of a quantization step - "VIDEO CODEC TEST MODEL, 
TMN5" 3.7 Buffer Regulation, Telenor Reserch, and 1995). (step 809) 
[0067] When a quantization step is immobilization, the frame is encoded by the 
fixed quantization step (step 810). 

[0068] If coding of a frame is completed, it will return to step 801. 
[0069] 

[Effect of the Invention] according to [ as explained above ] this approach - 
control of the amount of transmissions — setting — a user — it becomes possible 
to offer a programmable environment and the change of the control approach of 
the amount of transmissions according to fluctuation of contents or a 
transmission condition can be determined based on a transmission condition. 



DESCRIPTION OF DRAWINGS 



[Brief Description of the Drawings] 

[Drawing 1] The outline block diagram of the image voice sending set in the 
example of this invention 

[Drawing 2] The outline block diagram of the image voice receiving set in the 
example of this invention 

[Drawing 3] Drawing showing the transmitting example of the information 
transmitted between a receiving side and a transmitting side 
[Drawing 4] Drawing showing the example of the script transmitted to a 
transmitting side from a receiving side 

[Drawing 5] Drawing showing the example of the frame selection by the frame 
priority 

[Drawing 6] Drawing showing the decision approach of a script based on the tag 
information on contents 

[Drawing 7] The flow chart which shows actuation of the control approach 
interpretation means 

[Drawing 8] The flow chart which shows actuation of the amount control means 
of transmissions 
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